Annotating Geographical Entities

نویسندگان

  • Alexandru Salavastru
  • Daniela Gîfu
چکیده

This paper describes a study based on exploration of relations between geographical entities. We suggested a new tool for training and evaluation required by related annotation experiments. It relates to an annotator used for semi-automatic annotation, starting with the geography manual. We define fifteen types of entities: location, geo_position, geology, landform, clime, water, dimension, person, organization, URL, Timex, resource, industry, cultural, unknown with their specific subtypes. Moreover, we present the annotation conventions for three semantic relations: referential, structural and spatial, considered to be optimal operators in understanding a geographical manual. A part of the annotation is done manually, while the other part is done automatically, such as the token, lemma, part-of-speech. The study is intended to create a tool for the automatic detection of semantic relations in texts on geographic issues such as geography manuals, travel guides, geography atlases, etc., in order to help children, professors, guides, PR specialists and to be useful for tourists, generally to discover the complexity and the beauty of the nature.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Annotating Geographical Entities on Microblog Text

This paper presents a discussion of the problems surrounding the task of annotating geographical entities on microblogs and reports the preliminary results of our efforts to annotate Japanese microblog texts. Unlike prior work, we not only annotate geographical location entities but also facility entities, such as stations, restaurants, shopping stores, hospitals and schools. We discuss ways in...

متن کامل

Resources for Place Name Analysis

We present a new resource for annotating and visualizing the meaning of place names in natural language text, along with insights gained from analysis of manual annotations. The work addresses the issue of place name (toponym) meaning resolution, moving beyond simple named entity recognition to address the problem of grounding textual references, i.e., making a connection between the references...

متن کامل

Geographical localization of web domains and organization addresses recognition by employing natural language processing, Pattern Matching and clustering

Nowadays, the World Wide Web is growing at increasing rate and speed, and consequently the online available resources populating Internet represent a large source of knowledge for various business and research interests. For instance, over the past years, increasing attention has been focused on retrieving information related to geographical location of places and entities, which is largely con...

متن کامل

PAYMA: A Tagged Corpus of Persian Named Entities

The goal in the named entity recognition task is to classify proper nouns of a piece of text into classes such as person, location, and organization. Named entity recognition is an important preprocessing step in many natural language processing tasks such as question-answering and summarization. Although many research studies have been conducted in this area in English and the state-of-the-art...

متن کامل

Annotating Named Entities in Consumer Health Questions

We describe a corpus of consumer health questions annotated with named entities. The corpus consists of 1548 de-identified questions about diseases and drugs, written in English. We defined 15 broad categories of biomedical named entities for annotation. A pilot annotation phase in which a small portion of the corpus was double-annotated by four annotators was followed by a main phase in which ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Research in Computing Science

دوره 90  شماره 

صفحات  -

تاریخ انتشار 2015